AITopics | digital library

Collaborating Authors

digital library

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Predicting Brain Morphogenesis via Physics-Transfer Learning

Zhao, Yingjie, Song, Yicheng, Xu, Fan, Xu, Zhiping

arXiv.org Artificial IntelligenceSep-9-2025

Brain morphology is shaped by genetic and mechanical factors and is linked to biological development and diseases. Its fractal-like features, regional anisotropy, and complex curvature distributions hinder quantitative insights in medical inspections. Recognizing that the underlying elastic instability and bifurcation share the same physics as simple geometries such as spheres and ellipses, we developed a physics-transfer learning framework to address the geometrical complexity. To overcome the challenge of data scarcity, we constructed a digital library of high-fidelity continuum mechanics modeling that both describes and predicts the developmental processes of brain growth and disease. The physics of nonlinear elasticity from simple geometries is embedded into a neural network and applied to brain models. This physics-transfer approach demonstrates remarkable performance in feature characterization and morphogenesis prediction, highlighting the pivotal role of localized deformation in dominating over the background geometry. The data-driven framework also provides a library of reduced-dimensional evolutionary representations that capture the essential physics of the highly folded cerebral cortex. Validation through medical images and domain expertise underscores the deployment of digital-twin technology in comprehending the morphological complexity of the brain.

artificial intelligence, complexity, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2509.05305

Country: Asia > China (0.28)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area > Psychiatry/Psychology (1.00)
Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Health Care Technology (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Neuroscience (0.88)

Add feedback

MultiOCR-QA: Dataset for Evaluating Robustness of LLMs in Question Answering on Multilingual OCR Texts

Piryani, Bhawna, Mozafari, Jamshid, Abdallah, Abdelrahman, Doucet, Antoine, Jatowt, Adam

arXiv.org Artificial IntelligenceFeb-23-2025

Optical Character Recognition (OCR) plays a crucial role in digitizing historical and multilingual documents, yet OCR errors -- imperfect extraction of the text, including character insertion, deletion and permutation -- can significantly impact downstream tasks like question-answering (QA). In this work, we introduce a multilingual QA dataset MultiOCR-QA, designed to analyze the effects of OCR noise on QA systems' performance. The MultiOCR-QA dataset comprises 60K question-answer pairs covering three languages, English, French, and German. The dataset is curated from OCR-ed old documents, allowing for the evaluation of OCR-induced challenges on question answering. We evaluate MultiOCR-QA on various levels and types of OCR errors to access the robustness of LLMs in handling real-world digitization errors. Our findings show that QA systems are highly prone to OCR induced errors and exhibit performance degradation on noisy OCR text.

dataset, multiocr-qa, ocr error, (13 more...)

arXiv.org Artificial Intelligence

2502.16781

Country:

Europe > Austria > Tyrol > Innsbruck (0.05)
North America > United States > New York > New York County > New York City (0.04)
Europe > Switzerland > Zürich > Zürich (0.04)
(12 more...)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.90)
(2 more...)

Add feedback

Neural Network Modeling of Microstructure Complexity Using Digital Libraries

Zhao, Yingjie, Xu, Zhiping

arXiv.org Artificial IntelligenceJan-30-2025

Microstructure evolution in matter is often modeled numerically using field or level-set solvers, mirroring the dual representation of spatiotemporal complexity in terms of pixel or voxel data, and geometrical forms in vector graphics. Motivated by this analog, as well as the structural and event-driven nature of artificial and spiking neural networks, respectively, we evaluate their performance in learning and predicting fatigue crack growth and Turing pattern development. Predictions are made based on digital libraries constructed from computer simulations, which can be replaced by experimental data to lift the mathematical overconstraints of physics. Our assessment suggests that the leaky integrate-and-fire neuron model offers superior predictive accuracy with fewer parameters and less memory usage, alleviating the accuracy-cost tradeoff in contrast to the common practices in computer vision tasks. Examination of network architectures shows that these benefits arise from its reduced weight range and sparser connections. The study highlights the capability of event-driven models in tackling problems with evolutionary bulk-phase and interface behaviors using the digital library approach.

artificial intelligence, information management, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2501.18189

Country:

North America > United States (0.28)
Asia > Middle East > Israel > Mediterranean Sea (0.24)

Genre: Research Report (0.64)

Industry:

Energy > Oil & Gas (0.47)
Health & Medicine (0.46)
Aerospace & Defense (0.46)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Dynamic faceted search: from haystack to highlight

AIHubNov-20-2024, 09:42:45 GMT

In the digital age, the amount of scholarly articles is growing exponentially. In the Open Research Knowledge Graph's question-answering facility ASK, for example, more than 80 million research articles have already been indexed. Finding the most relevant information from vast collections of scholarly data can be daunting for researchers, students, and academics. To tackle this challenge, search engines and digital libraries often rely on advanced search techniques, one of the most effective being faceted search. Faceted search is an advanced search method that allows users to filter and refine search results based on multiple predefined attributes, known as facets.

dataset, dynamic facet generation, dynamic faceted search, (10 more...)

AIHub

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.80)

Add feedback

A Library Perspective on Supervised Text Processing in Digital Libraries: An Investigation in the Biomedical Domain

Kroll, Hermann, Sackhoff, Pascal, Thang, Bill Matthias, Ksouri, Maha, Balke, Wolf-Tilo

arXiv.org Artificial IntelligenceNov-6-2024

Digital libraries that maintain extensive textual collections may One way to explore a digital library's content is to apply natural want to further enrich their content for certain downstream applications, language processing methods, e.g., identify central entities (e.g., e.g., building knowledge graphs, semantic enrichment of the Person Albert Einstein), their relationships (e.g., Albert Einstein documents, or implementing novel access paths. All of these applications was born in Ulm), and classify documents as belonging to require some text processing, either to identify relevant classes (e.g., descriptive articles). The extraction of semantic relationships entities, extract semantic relationships between them, or to classify between named entities is already used in several digital documents into some categories. However, implementing reliable, library projects for different purposes, e.g., constructing a biomedical supervised workflows can become quite challenging for a digital knowledge graph from scientific papers like SemMedDB [18], library because suitable training data must be crafted, and reliable harvesting leader boards of how computer science methods perform models must be trained. While many works focus on achieving the on benchmarks [17], harvesting scientific information as done highest accuracy on some benchmarks, we tackle the problem from in SciGraph [44], enabling graph-based discovery systems in digital a digital library practitioner. In other words, we also consider tradeoffs libraries [20], or enriching library content like newspapers as done between accuracy and application costs, dive into training data in the Swiss-Luxembourgish impresso [10].

benchmark, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3677389.3702557

2411.12752

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Asia > China > Hong Kong (0.05)
Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
(8 more...)

Genre:

Overview (0.92)
Research Report > New Finding (0.46)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.69)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

STONYBOOK: A System and Resource for Large-Scale Analysis of Novels

Pethe, Charuta, Kim, Allen, Prabhakar, Rajesh, Pial, Tanzir, Skiena, Steven

arXiv.org Artificial IntelligenceNov-6-2023

Books have historically been the primary mechanism through which narratives are transmitted. We have developed a collection of resources for the large-scale analysis of novels, including: (1) an open source end-to-end NLP analysis pipeline for the annotation of novels into a standard XML format, (2) a collection of 49,207 distinct cleaned and annotated novels, and (3) a database with an associated web interface for the large-scale aggregate analysis of these literary works. We describe the major functionalities provided in the annotation system along with their utilities. We present samples of analysis artifacts from our website, such as visualizations of character occurrences and interactions, similar books, representative vocabulary, part of speech statistics, and readability metrics. We also describe the use of the annotated format in qualitative and quantitative analysis across large corpora of novels.

computational linguistic, corpus, proceedings, (14 more...)

arXiv.org Artificial Intelligence

2311.03614

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > Maryland (0.05)
North America > United States > New York > Suffolk County > Stony Brook (0.04)
(6 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Topological Data Analysis in smart manufacturing processes -- A survey on the state of the art

Uray, Martin, Giunti, Barbara, Kerber, Michael, Huber, Stefan

arXiv.org Artificial IntelligenceOct-13-2023

Topological Data Analysis (TDA) is a mathematical method using techniques from topology for the analysis of complex, multi-dimensional data that has been widely and successfully applied in several fields such as medicine, material science, biology, and others. This survey summarizes the state of the art of TDA in yet another application area: industrial manufacturing and production in the context of Industry 4.0. We perform a rigorous and reproducible literature search of applications of TDA on the setting of industrial production and manufacturing. The resulting works are clustered and analyzed based on their application area within the manufacturing process and their input data type. We highlight the key benefits of TDA and their tools in this area and describe its challenges, as well as future potential. Finally, we discuss which TDA methods are underutilized in (the specific area of) industry and the identified types of application, with the goal of prompting more research in this profitable area of application.

artificial intelligence, data mining, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2310.09319

Country:

North America > United States (0.93)
Europe > Austria (0.28)
Europe > Germany (0.14)
Asia (0.14)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Information Technology > Security & Privacy (0.93)
Semiconductors & Electronics (0.93)
Energy > Oil & Gas > Upstream (0.46)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)
(2 more...)

Add feedback

ACM: Digital Library: Communications of the ACM

#artificialintelligenceDec-1-2022, 18:45:42 GMT

Forecasting rates of sea level change in polar ice shelves: Polar scientists, along with atmospheric and ocean scientists, face an urgent need to understand sea level rise around the globe. Ice-shelf environments represent extreme environments for sampling and sensing. Current efforts to collect sensed data are limited and use tethered robots with traditional sampling frequency and collection limitations. The ability to collect extensive data about conditions at or near the ice shelves will inform our understanding about changes in ocean circulation patterns, as well as feedbacks with wind circulation. New research on intelligent sensors would support selective data collection, onboard data analysis, and adaptive sensor steering.

earth system, information, scientist, (14 more...)

#artificialintelligence

Genre: Research Report > New Finding (0.38)

Technology:

Information Technology > Information Management (0.76)
Information Technology > Artificial Intelligence > Robots (0.38)

Add feedback

A Bayesian Learning, Greedy agglomerative clustering approach and evaluation techniques for Author Name Disambiguation Problem

Sourav, Shashwat

arXiv.org Artificial IntelligenceNov-1-2022

Author names often suffer from ambiguity owing to the same author appearing under different names and multiple authors possessing similar names. It creates difficulty in associating a scholarly work with the person who wrote it, thereby introducing inaccuracy in credit attribution, bibliometric analysis, search-by-author in a digital library, and expert discovery. A plethora of techniques for disambiguation of author names have been proposed in the literature. I try to focus on the research efforts targeted to disambiguate author names. I first go through the conventional methods, then I discuss evaluation techniques and the clustering model which finally leads to the Bayesian learning and Greedy agglomerative approach. I believe this concentrated review will be useful for the research community because it discusses techniques applied to a very large real database that is actively used worldwide. The Bayesian and the greedy agglomerative approach used will help to tackle AND problems in a better way. Finally, I try to outline a few directions for future work.

artificial intelligence, data mining, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2211.01303

Country: Asia > India > Madhya Pradesh > Bhopal (0.04)

Genre: Research Report (0.82)

Industry: Information Technology (0.46)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.83)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.71)

Add feedback

National Digital Library of India

Communications of the ACMOct-21-2022, 18:17:14 GMT

The National Digital Library of India was conceptualized with an aim to bring equity of access to educational resources for every Indian through a single window access mechanism.

digital library, india, ndli, (13 more...)

Communications of the ACM

Country:

Asia > India > West Bengal > Kharagpur (0.07)
North America > United States (0.05)
Asia > India > West Bengal > Kolkata (0.05)

Genre: Instructional Material (0.71)

Industry: Education > Educational Setting > Online (0.31)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.48)

Add feedback